A New Effective System for Filtering Pornography Images from Web Pages and PDF Files

نویسندگان

  • Moheb R. Girgis
  • Tarek M. Mahmoud
  • Tarek Abd-El-Hafeez
چکیده

One of the more rapidly growing areas in search technology is image search. With this availability comes the natural need to filter offensive content, to prevent Pornography images from reaching the wrong eyes. Filtering and blocking software is one of the most frequently touted prevention devices. As any user of these services is aware, they often fail to remove offensive images. The reasons are clear in that “current Internet image search technology is based upon words, rather than image content”, as images are obtained by using the image filename or text that surrounds the image on a webpage. This paper presents an automatic software system for detecting and filtering Pornography images of any format (JPG, PNG, etc.) in web pages by using skin recognition. The proposed system is an online client-side filtering system that allows the user to choose skin detection filtering, keyword filtering, URL filtering, domain filtering, or combination of two or more of these filtering techniques. A hybrid skin color detection technique is proposed to overcome the failure of detecting complex image’s background. To the best of our knowledge, there is no tool to block pornographic images included in PDF documents. So, we augmented our proposed system with the capability of online and offline filtering of PDF documents based on skin color detection. The experimental results showed that our proposed filtering system is more effective and accurate compared to some of the established commercial filtering software.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Use of Semantic Similarity and Web Usage Mining to Alleviate the Drawbacks of User-Based Collaborative Filtering Recommender Systems

  One of the most famous methods for recommendation is user-based Collaborative Filtering (CF). This system compares active user’s items rating with historical rating records of other users to find similar users and recommending items which seems interesting to these similar users and have not been rated by the active user. As a way of computing recommendations, the ultimate goal of the user-ba...

متن کامل

Pixel-Based Skin Detection for Pornography Filtering

A robust skin detector is the primary need of many fields of computer vision, including face detection, gesture recognition, and pornography filtering. Less than 10 years ago, the first paper on automatic pornography filtering was published. Since then, different researchers claim different color spaces to be the best choice for skin detection in pornography filtering. Unfortunately, no com...

متن کامل

Clasificador de Páginas Web Pornográficas Basado en el Contenido de las Imágenes

The World Wide Web, or web, is an information access and search logic system available on the Internet whose informative units are web pages. The web has facilitated the publication of big amount of information accessible from anywhere in the world; however, part of this content such as pornography is regarded inappropriate for some users. To contribute to the pornography filtering on web, this...

متن کامل

PDFlib, PDFlib+PDI, Personalization Server data sheet

What is PDFlib? PDFlib is the leading developer toolbox for generating and manipulating files in the Portable Document Format (PDF). PDFlib’s main targets are dynamic PDF creation on a Web server or any other server system, and to implement »Save as PDF« in existing applications. You can use PDFlib to dynamically create PDF documents from database contents, similar to dynamic Web pages. PDFlib ...

متن کامل

Pornography Web Pages Classification with Textual Content Analysis Using Entropy Term Weighting Scheme for Small Class Dataset

The fast growth of internet make objectionable web content such as pornography and violence easily explore to web users especially children and teenagers. Due to some popular web filtering techniques like Uniform Resource Locator blocking and Platform for Internet Content Selection checking are limited against today dynamic web content, hence content based analysis techniques with effective mod...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • IJWA

دوره 2  شماره 

صفحات  -

تاریخ انتشار 2010